A Distributed Ranking Algorithm for the iTrust Information Search and Retrieval System

نویسندگان

  • Boyang Peng
  • Louise E. Moser
  • P. M. Melliar-Smith
  • Yung-Ting Chuang
  • Isai Michel Lombera
چکیده

The iTrust system is a decentralized and distributed system for publication, search and retrieval of information over the Internet and the Web, that is designed to make it difficult to censor or filter information. In the distributed ranking algorithm for iTrust presented in this paper, a source node that publishes a document indexes the words in the document and produces a term-frequency table for the document. A requesting node that issues a query and receives a response uses the URL in the response to retrieve the term-frequency table from the source node. The requesting node then uses the term-frequency tables from multiple source nodes and a ranking formula to score the documents with respect to its query. Our evaluations of the distributed ranking algorithm for iTrust demonstrate that the algorithm exhibits stability in ranking documents and that it counters scamming by malicious nodes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ارائه الگوریتمی مبتنی بر یادگیری جمعی به منظور یادگیری رتبه‌بندی در بازیابی اطلاعات

Learning to rank refers to machine learning techniques for training a model in a ranking task. Learning to rank has been shown to be useful in many applications of information retrieval, natural language processing, and data mining. Learning to rank can be described by two systems: a learning system and a ranking system. The learning system takes training data as input and constructs a ranking ...

متن کامل

An Effective Path-aware Approach for Keyword Search over Data Graphs

Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...

متن کامل

بررسی تأثیرات ریشه‌یابی در بازیابی اطلاعات در زبان فارسی

Using the language-specific behavior in information retrieval systems can improve the quality of the retrieved results significantly. Part of the word that remains after removing its affixes is called stem. Stemming process can be used for improving the relevancy of the results in information retrieval system. Different morphological variants of words (plural, past tense…) will be mapped into t...

متن کامل

Trustworthy Distributed Search and Retrieval over the Internet

This paper describes iTrust, a novel distributed search and retrieval system that provides trustworthy access to information over the Internet. Nodes with information to distribute transmit their metadata to nodes that are selected at random from a set of participating nodes. Similarly, nodes seeking information distribute their requests to nodes that are selected at random from the set of part...

متن کامل

Chaotic Genetic Algorithm based on Explicit Memory with a new Strategy for Updating and Retrieval of Memory in Dynamic Environments

Many of the problems considered in optimization and learning assume that solutions exist in a dynamic. Hence, algorithms are required that dynamically adapt with the problem’s conditions and search new conditions. Mostly, utilization of information from the past allows to quickly adapting changes after. This is the idea underlining the use of memory in this field, what involves key design issue...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013